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The etiology and pathogenesis of idiopathic interstitial lung disease (ILD) remain incompletely understood. 
Genetic susceptibility to ILD has been demonstrated in previous studies. It is well known that EGFR 
inhibitors can induce ILD in human lung cancer patient with ethnic differences, which prompted us to 
hypothesize that genetic variation in EGFR pathway genes confer susceptibUity to ILD. We aimed in this 
study to investigate whether functional polymorphisms of EGFR and its ligands genes (EGF and TGFA) 
were associated with ILD. Three EGFR [-216G/T (rs712830), -191A/C (rs712829), 497R > K(A/G) 
(rs2227983)], one EGF [61A/G, (rs4444903)] and one TGFA (rs3821262C/T) polymorphisms previously 
demonstrated to alter gene functions were genotyped in 229 sporadic idiopathic ILD patients and 693 
normal healthy individuals. Allelic and genotypic association tests between these polymorphisms and ILD 
were performed. The £Gf 61 A/G polymorphism was significantly associated with elevated risk of ILD, with 
the frequency of G allele significantly increased in the ILD patient population (OR — 1.33, 95%CI — 1.07- 
1.66, P — 0.0099). None of the other polymorphisms were associated with risk of ILD. Our study suggested 
that the EGF 61A/G polymorphism may be associated with sporadic ILD. While a false positive finding 
cannot be excluded, independent studies are warranted to further validate this result. 



Interstitial lung disease (ILD) refers to a broad range of chronic lung disorders with diverse pathogenesis and 
complex histopathology, together accounting for 15% of respiratory care practice'. Most entities are mani- 
fested as epithelial injury, followed by fibroblastic proliferation and development of fibroblastic foci with 
exuberant deposition of matrix - typical hallmarks of pulmonary fibrosis^ '. Over two thirds of ILD cases do not 
have a known cause and are thus named idiopathic interstitial pneumonia (IIP). Although the incidence of ILD in 
the US is low (approximately 30 cases per 100,000 persons per year), the disease can be progressive and fatal. The 
mean survival time of ILD patients is only about 3 years'*. The etiology and pathogenesis of most ILD entities 
remain unknown, thus greatly hampering progress in the development of therapeutics for the disease. To date, no 
proven drug therapy for most entities has been recognized^ ''. 

It is now widely accepted that the development of ILD has a strong genetic basis. Substantial evidence 
demonstrates that ILD is a heritable complex disease determined by genetic factors with involvement of envir- 
onmental stimuli, such as tobacco smoke''^'^ ". Family-based studies have been conducted in an attempt to identify 
genes predisposing to ILD, and causal mutations have been identified in several genes, e.g. telomerase reverse 
transcriptase gene (TERT), the telomerase RNA component gene (TERC), surfactant proteins A2 {SPA2) and C 
(SPC) genes' More recently, a few genome-wide association studies have identified a number of single 
nucleotide polymorphisms (SNP) located at or close to TERT'\ TERC", MUC5B"'-'% FAM13A", DSP", 
OBFCl", ATPllA", DPP9", TOLLIP" and SPPL2C" genes significantly (P < 5 X lO""^) associated with IFF 
and/or IIP as an overall phenotype. However, these polymorphisms together were estimated to account for about 
only one third of the risk of IIP, suggesting additional genetic component yet to be identified'^ 

The epidermal growth factor receptor (EGFR) is a tyrosine kinase receptor for various growth factors including 
EGF (epidermal growth factor), TGF-ot (transforming growth factor-a) and other EGF-like ligands. The EGFR 
pathway plays an important role in pulmonary physiology especially the function of epithelial cells via signaling 
transduction that regulates key cellular processes such as self-renew, wound-healing, proliferation, survival. 
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Table 1 | Demographic and covariates data associated with ILD patients and controls 

Gender Age 

Disease status Total number Male (N) Mean ± SD Current Ever Never Mean ± SD 



I J Cigarette Smoking 
Gender Age BMI 



UIP/IPF 


84 


55 


64.4 ± 7.7 


1 


52 


31 


30.3 ± 5.2 


NSIP 


27 


10 


59.1 ± 8.4 


1 


15 


1 1 


30.5 ± 6.8 


DIP 


9 


7 


54.1 ± 7.7 


3 


4 


2 


31.3 ± 8.7 


RB-ILD 


22 


13 


58.1 ± 1 1.5 


2 


17 


3 


32.5 ± 7.1 


COP 


10 


7 


62.2 ± 1 1.9 


2 


6 


2 


29.0 ± 6.7 


HP 


8 


1 


52.0 ± 16 


0 


1 


7 


31.7±5.7 


UP 


1 


1 


49 


0 


1 


0 


27.4 


AlP 


1 


0 


74 


0 


0 


1 


26.6 


UF 


67 


40 


64.8 ± 10.4 


1 


45 


21 


29.4 ± 5.6 


Control 


693 


360 


55.7 ± 13.2 











SD, standard deviation; BMI, body mass index. 



adhesion, migration and differentiation. EGFR inhibitors have been 
widely used in treatment of non-small cell lung cancer (NSCLC). 
However, ILD has been consistently reported as one of the uncom- 
mon but severe adverse reactions of EGFR inhibitors''"^^. A strong 
association between the incidence of ILD and anti-EGFR treatments 
has been reported in a large case-cohort study that included over 
4,000 subjects. The study showed a 3.23-fold increase in risk of ILD in 
patients who received gefitinib when compared with those who 
underwent conventional chemotherapy^^. Furthermore, significant 
inter-ethnic differences in the incidence of ILD in patients treated 
with EGFR inhibitors has been consistently observed. According to 
the U.S. Food and Drug Administration (FDA), an overall ILD incid- 
ence of 1% was demonstrated in 50,005 patients receiving gefitinib, 
including 18,960 patients from Japan and 23,000 from the U.S. 
Interestingly, the incidence of ILD was higher in Japanese patients 
(1.7%) compared to patients from the US (0.3%). There was also a 
significant difference in the median time to onset (TTO) of ILD 
between Japanese and U.S. patients. The TTO was about 24 days 
in the former but around 42 days in the latter^''. These findings have 
been confirmed in other independent studies^''. Taken together, these 
observations suggest that certain genetic factors related to the EGFR 
pathway may confer susceptibility to ILD in general. In order to 
corroborate this hypothesis, we set out in this study to test the genetic 
association between functional polymorphisms in EGFR, EGF and 
TGFA genes and ILD. These polymorphisms have been previously 
demonstrated to alter gene expression, function or other related 
phenotypes in our and other's studies"'*"^'*. 

Methods 

Etliics statement. Research conducted in this study was performed on anonymous 
adult individuals without intervening with patients, and is therefore not considered to 
involve 'human subjects.' Samples were collected with written informed consent 
obtained from participants with approval of institutional review boards (IRBs) at the 
Lung Tissue Research Consortium {LTRC, http;//www.ltrcpublic.com) and The 
University of Chicago. The study was carried out in accordance with the approved 
guidelines by the Purdue University IRB (approval number 1307013815) and was in 
compliance with the Helsinki Declaration. 

Study population. DNA extracted from peripheral blood of ILD patients (n — 227) 
were obtained fi"om the Lung Tissue Research Consortium {http://www.ltrcpublic. 
com). All patients were diagnosed with ILD in accordance with the American Thoracic 
Society/European Respiratory Society International Multidisciplinary Consensus 
Classification of the Idiopathic Interstitial Pneumonias'^^, with well- documented clinical 
data, Computed Tomography (CT) scan and pathological review of lung biopsies for all 
patients. Cases with known cause for the disease were excluded. The DNA samples 
came from patients who had been diagnosed with idiopathic pulmonary fibrosis (IPF) 
(n = 84), non-specific interstitial pneumonia (NSIP) (n — 27), desquamative interstitial 
pneumonia (DIP) (n — 9), respiratory bronchiolitis-interstitial limg disease (RB-ILD) 
(n = 22), cryptogenic organizing pneumonia (COP) (n — 10), hypersensitive 
pneumonitis (HP) (n — 8), lymphocytic interstitial pneumonia (LIP) (n — 1), acute 
interstitial pneumonia (AIP) (n — 1) and uncharacterized fibrosis (UF) (n — 67). 
Control DNA samples (n — 693) were collected and provided by the Translational 
Research Initiative in the Department of Medicine (TRIDOM) program at the 



University of Chicago. These samples came from patients regularly visiting the clinic, 
excluding individuals with any respiratory symptoms according to the ICD-9 
classification. All control patients are self-reported Caucasians. Demographic data for 
ILD patients and controls were summarized in Table 1. 

Genotyping. The selected three EGFR [-216G/T (rs712830), -191A/C (rs712829), 
497R > K(A/G) (rs2227983)], one EGF [61A/G, (rs4444903)] and one TGFA 
(rs3821262C/T) polymorphisms were genotyped using polymerase chain reaction- 
restriction fragment length polymorphism (PCR-RFLP) method. Primers were 
designed as described previously'*'*. The genotyping procedure was performed 
according to a protocol reported previously''''-^^'"*. For quality control purposes, 10% 
of the cases and controls were selected for PCR- sequencing to confirm the genotypes 
of all polymorphisms. As a result, all samples genotyped using PCR-RFLP and 
sequencing had concordant genotypes. 

Statistical analysis. A Chi-squared test was used for evaluating the Hardy- Weinberg 
Equilibrium (HWE) of each polymorphism, as well as the allelic associations between 
each polymorphism and ILD. The HWE was tested based on df — 1. A multiple 
logistic regression model was also fit with all five SNPs and age as well as gender as 
covariates. Therefore, the total number of predictors in the model is 7. To select 
potential important variables, the "PROC GLMSELECT" procedure in SAS was 
employed according to the previous suggestion"". Using Bonferroni method to 
control family wise error rate, two significant variables including £GF 61 A/G and age 
were included in the final model. In addition, we have also performed a Cochran- 
Armitage trend test to investigate the potential additive trend for the association 
between £Gf 61A/G genotypes and ILD. All hypothesis tests were two-sided, and a P 
value less than 0.01 (0.05/5) is considered statistically significant. Data analyses were 
performed using SAS 9.3 (SAS Institute, Gary NC) and PLINK". 

Results 

We used a case-control study design to examine whether poly- 
morphisms of EGFR and its ligand genes were associated with sus- 
ceptibility to ILD. The observed genotype frequencies of these SNPs 
were all in agreement with the HWE in the control subjects (df = 1, 
P > 0.05 for all tests, data not shown). The genotype and allele 
distributions of the five SNPs between the cases and controls are 
summarized in Table 2. 

Significant association was observed between the EGF 61A/G 
polymorphism and ILD (OR = 1.33, 95%CI = 1.07-1.66, P = 
0.0099), with the A allele frequency being significantly higher in 
the cases (64%) than the controls (57%). Meanwhile, there appeared 
to be an additive effect of the A allele in association with the pheno- 
type, with an increased odds ratio between A/ A group and G/G group 
when compared to that between G/A and G/G groups (G/A vs G/G: 
OR = 1.44, 95%CI = 0.91-2.29; A/A vs G/G: OR = 1.85, 95%CI = 
1.15-2.98, Ptrend ~ 0.0083) (data not shown). No significant asso- 
ciation was found between other polymorphisms and ILD. After 
adjusting for age and gender, the association between EGF 61A/G 
and ILD remained to be significant (P = 0.0087) (data not shown). 

Discussion 

We observed a statistically significant association between an EGF 
polymorphism that previously demonstrated to alter EGF gene 
expression and ILD. While the association was weak, the results 
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Table 2 | Association between EGFR pathway polymorphisms and ILD 



Polymorphism 




Genotype (N) 




Allele 


(N) 


RAF 


OR 


95% CI 


P 






AG 


AA 


Q 












ILD 


50 


1 22 


57 


222 


236 


0 485 


1 09 

I .\J7 


n HO 1 Qi; 


0 41 




154 


330 


206 


638 


742 


0.462 










GG 


AG 


AA 


Q 












ILD 


28 


1 09 


90 


1 65 


289 


0 363 


1 33 


1 07-1 66 






1 26 


341 


2 1 9 


593 


779 


0 432 










GG 


GT 


jj 


J 


(3 










ILD 


109 


98 


21 


140 


316 


0.307 


0.87 


0.69-1 .09 


0.22 


Contr. 


304 


309 


80 


469 


917 


0.338 








£GF/? 497G/A 


GG 


AG 


AA 


A 


G 










ILD 


123 


91 


15 


121 


337 


0.264 


1.06 


0.84-1.35 


0.62 


Contr. 


387 


250 


48 


346 


1024 


0.253 








EGFR -1 9 IC/A 


AA 


CA 


CC 


A 


C 










ILD 


6 


65 


157 


77 


379 


0.169 


1.27 


0.95-1.69 


0.1 1 


Contr. 


12 


166 


510 


190 


1 186 


0.138 









N = number; RAF = rare allele frequency; OR = odds ratio; CI = confidence interval 



remained significant with Bonferroni correction and after adjusting 
for age and gender. Ttie Cochran- Armitage test for trend also sug- 
gested a possible additive effect of the risk allele. Unfortunately, our 
result lacked further validation with additional sample sets. 
Therefore, independent confirmation of our finding is necessary. If 
this result can be further replicated, it would indicate that genetic 
variation in the EGFR pathway may confer risk to general ILD. 

EGFR inhibitors (EGFRi) e.g. gefitinib and erlotinib have been 
widely demonstrated to induce ILD in lung cancer patients. 
Although induction of ILD was observed with many drugs, it was 
higher in EGFRi treatment than chemotherapy''. Meanwhile, pre- 
existing ILD has been identified as one of the risk factors for EGFRi- 
associated ILD''. These lines of evidence suggest that EGFRi-induced 
ILD may share similar pathobiology with the general ILD. 
Unfortunately the EGFRi-associated ILD is rare, and it is difficult 
to test its association with genetic polymorphisms in the EGFR-axis 
genes. Previous studies have demonstrated that EGFR and its ligands, 
in particular EGF and TGF-ot, exert an essential function in lung 
epithelium and fibroblastic cells under both normal and fibrotic 
conditions. Bronchial epithelium cells, smooth muscle cells and 
fibroblasts are all involved in fibrogenesis and have high EGFR, 
EGF and TGF-y. gene expression'*". Alveolar epithelial injury is 
deemed to be the initial step in the development of pulmonary fib- 
rosis. It has been demonstrated that IL- 1 P induced epithelial repair is 
through EGF and TGF-a dependent pathways"". EGF treatment in 
mouse was also found dramatically increasing pulmonary expression 
of surfactant protein C (SP-C) where pathogenic mutations in ILD 
patients have been discovered'"'""'^. EGF also enhances fibronectin 
synthesis in fibrotic human lung fibroblasts'''. Recently, studies on 
animal models have also established that both EGFR and TGF-cx 
were genetically involved in the development of ILD. Mouse models 
with either constitutive or conditional expression of TGF-a in lung 
epithelial cells were shown to develop PF phenotypes"*". On the 
other hand, when inducing PF with bleomycin, TGF-a knockout 
mice had significantly reduced fibrotic phenotypes compared with 
wild-type mice*". Furthermore, it was demonstrated that PF was 
prevented in another bitransgenic mouse model with constitutive 
expression of TGF-a but genetically disrupted EGFR*\ These find- 
ings collectively suggest that activation of the EGFR pathway is crit- 
ical for the pathogenesis of ILD. However, this plausibly contradicts 
the observation that inhibition of EGFR induces ILD in humans. 
Similar controversial findings were observed in animal models as 
well with regard to the relationship between EGFRi treatment and 
PF phenotypes. Gefitinib was found to prevent bleomycin-induced 
lung fibrosis*"''*' and diminish TGF-a overexpression-induced 



pulmonary fibrosis in mice**. In contrast, other studies found gefiti- 
nib augments pneumonitis*' and exacerbate the bleomycin-induced 
lung fibrosis*' *'. Why both excessive activation of EGFR and inhibi- 
tion of EGFR cause ILD remains an unanswered question. This 
reflects a poor understanding of the function of EGFR in lung fib- 
rosis. It should be noted that the most current studies of EGFR in ILD 
are based on animal models or have used very few human samples, 
and many were performed in vitro. Since multiple types of cells are 
involved in ILD, detailed knowledge in EGFR signaling among those 
cells is extremely insufficient. Nevertheless, the evidence accumu- 
lated so far consistently suggested an important role of EGFR genes 
in both pathophysiology and genetics of the disease. It is thus reas- 
onable that genetic variations in EGFR pathway may at least in part 
contribute to the natural history of ILD. Interestingly, a recent 
GWAS observed that two polymorphisms (rs79842896 and 
rs76795398) close to (—33 kb downstream) the EGFR gene were 
top-ranked to possess associations with IPF {P s 10"'), although 
this association did not reach the genome-wide significance". 
Continued study on more EGFR polymorphisms is thereby necessary 
to further clarify the role of EGFR variation in ILD. 

The polymorphisms tested in this study have been proven to be 
functional in determining the expression or activity of EGFR or its 
ligands in both our and others' studies. We have demonstrated that 
the two EGFR polymorphisms -216G/T and -191C/A were assoc- 
iated with increased EGFR promoter activity and gene/protein 
expression^*'^'. They were also shown to be associated with drug- 
induced toxicities e.g. skin rash in EGFR inhibitor treatment in our 
previous study*". A SNP at EGFR codon 497 results in an Arg (R) to 
Lys (K) substitution, which has been associated with decreased EGFR 
activity^''^'. The TGFA intronic polymorphism rs3821262C/T was 
also found to be associated with TGFA gene expression as well as 
sensitivity to EGFR inhibitors in cancer cell lines in our previous 
study^". With regard to EGF 61A/G polymorphism, it was initially 
demonstrated to affect EGF protein expression, with the G allele 
associated with a higher EGF level relative to the A allele*'. A recent 
meta-analysis of 41 case-control studies on various cancers have 
shown that the G allele was significantly associated with increased 
cancer risk'". It is thus possible that the relatively lower EGF level 
associated with the A allele might be the reason underlying its asso- 
ciation with ILD in our study. This supports the notion that a rela- 
tively lower EGFR-axis activity might be a risk factor for ILD, which 
is consistent with the observation that inhibition of the EGFR signal- 
ing induces ILD. 

It should be further noted that previous GWAS did not identify 
this locus. While most previous GWAS were focused on IPF rather 
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than ILD as a phenotype, it might be also due to the population 
difference. It is commonly observed that many loci identified in 
GWAS actually exert different effect size in different populations. 
Nevertheless, without independent validation, a false positive result 
in the association observed in our study could not be excluded, in 
particular that our study was limited by the relatively small sample 
size. Therefore, our findings should be used and interpreted with 
caution. 

Conclusions 

Our study provides a new investigation of the relationship between 
functional EGFR pathway gene polymorphisms and risk of ILD. The 
findings suggested a possible association between EGF 61A/G poly- 
morphism and ILD. Further validation of this genetic association in 
independent sample sets is warranted. 
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